Buy one get one free: Distant annotation of Chinese tense, event type and modality

نویسندگان

  • Nianwen Xue
  • Yuchen Zhang
چکیده

We describe a “distant annotation” method where we mark up the semantic tense, event type, and modality of Chinese events via a word-aligned parallel corpus. We first map Chinese verbs to their English counterparts via word alignment, and then annotate the resulting English text spans with coarse-grained categories for semantic tense, event type, and modality that we believe apply to both English and Chinese. Because English has richer morpho-syntactic indicators for semantic tense, event type and modality than Chinese, our intuition is that this distant annotation approach will yield more consistent annotation than if we annotate the Chinese side directly. We report experimental results that show stable annotation agreement statistics and that event type and modality have significant influence on tense prediction. We also report the size of the annotated corpus that we have obtained, and how different domains impact annotation consistency.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distant annotation of Chinese tense and modality

In this paper we describe a “distant annotation” method by which we mark up tense and modality of Chinese eventualities via a wordaligned parallel corpus. We first map Chinese verbs to their English counterpart via word alignment, and then annotate the resulting English text spans with coarse-grained tense and modality categories that we believe apply to both English and Chinese. Because Englis...

متن کامل

Automatic Inference of the Tense of Chinese Events Using Implicit Linguistic Information

We address the problem of automatically inferring the tense of events in Chinese text. We use a new corpus annotated with Chinese semantic tense information and other implicit Chinese linguistic information using a “distant annotation” method. We propose three improvements over a relatively strong baseline method – a statistical learning method with extensive feature engineering. First, we add ...

متن کامل

Annotating Event Mentions in Text with Modality, Focus, and Source Information

Many natural language processing tasks, including information extraction, question answering and recognizing textual entailment, require analysis of the polarity, focus of polarity, tense, aspect, mood and source of the event mentions in a text in addition to its predicateargument structure analysis. We refer to modality, polarity and other associated information as extended modality. In this p...

متن کامل

Tense, Time, and Adverbs in Italian Sign Language

Sandro Zucchi Dipartimento di Scienze della Comunicazione Università di Salerno One difference between the Italian Sign Language sentences in (1)-(3) below and their English translations is that, while the English predicates are inflected for tense, the sign for the verb in Italian Sign Language (LIS) appears in its citational form: (1) GIANNI HOUSE BUY “Gianni is buying a house” (2) TIME-AGO G...

متن کامل

How And Where Do People Fail With Time: Temporal Reference Mapping Annotation By Chinese And English Bilinguals

This work reports on three human tense annotation experiments for Chinese verbs in Chinese-to-English translation scenarios. The results show that inter-annotator agreement increases as the context of the verb under the annotation becomes increasingly specified, i.e. as the context moves from the situation in which the target English sentence is unknown to the situation in which the target lexi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014